Reality Checks for a Distributional Assumption: The Case of “Benford’s Law”
نویسنده
چکیده
In recent years, many articles have promoted uses for “Benford’s Law,” claimed to identify a nearly ubiquitous distribution pattern for the frequencies of first digits of numbers in many data sets. Detecting fraud in financial and scientific data is a suggested application. Like the Normal and Chi-square distributions, Benford’s appears to offer an appealingly clear-cut, mathematically tractable, and widely applicable tool. However, similar to those other models, writers may “assume” the model meets all the assumptions needed for hypothesis testing, without properly examining whether those conditions hold. This paper examines a diverse set of real-world data sets to demonstrate that while Benford’s-like patterns are indeed common, Benford's per se is not a unique and universal template for all cases of interest to fraud investigators. This reminds us of how, in general, distributional assumptions can sometimes be overlooked or fail to be critically questioned.
منابع مشابه
Application of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملForensic Data Pattern Analysis using Information Entropy
We take a data mining approach to characterize distributional patterns of financial/accounting data. We especially analyze underlying information uncertainty in heterogeneous forms of records using the measure of ‘entropy’. In today’s accounting information systems, effective detection of transaction frauds become a key component for the success of e-commerce systems. A popular method in the cu...
متن کاملSurvival Distributions Satisfying Benford’s Law
Hill stated that “An interesting open problem is to determine which common distributions (or mixtures thereof) satisfy Benford’s law . . .”. This article quantifies compliance with Benford’s law for several popular survival distributions. The traditional analysis of Benford’s law considers its applicability to datasets. This article switches the emphasis to probability distributions that obey B...
متن کاملSensitivity to statistical regularities: People (largely) follow Benford’s law
Recent decision making research has emphasized people’s sensitivity to statistical relationships in the environment. A little-known relationship is Benford’s law, that the first digits of numbers representing many natural and human phenomena have a logarithmic distribution (Benford, 1938). Benford’s law is being used to help detect fraudulent financial data, but this assumes that people will no...
متن کاملBenford’s Law
Benford’s Law predicts the frequency of the leading digit in numbers met in a wide range of naturally occurring phenomena. In data following Benford’s Law, numbers start with a small leading digit more often those with a large leading digit. Here we demonstrate that Benford’s Law also describes a wide range of computational phenomena. In particular, we show that a number of different statistics...
متن کامل